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ABSTEACT 

If ' Hinitab is a statistical computing^ system vhich uses 

simple language, produces clear output, and keeps tirack of 
bookkeeping automatically. Error checking With English diagnostic's 
and inclusion of several default options help to facilitate use of 
the system by students. Hinitab II is an improved and expanded 
version of the original Hinitab vhich was based on the Omnitab System 
of the National Bureau of Standards. With the new system students can 
analyze more extensive and realistic data sets and, as a consequence, 
can devote mora.^^tijie to the concepts of statistics rather than to 
calculations. Hinitab consists Ma worksheet (normally 200 rows tiy 
50 columns) and about 130 free format commands which provide A wide 
variety of capabilities. The system is in current use not onLy in 
academic settings but in industrial enMfonments as well. Hinitab is 
written in machine compatible FOETEAN IV and' can be implemented on a 
variety of computers. Examples of a paired t^test, a data analysis, / 
an illustration of the concept of a limi^, a simulation, and a 
polynomial r^gressi^n with data transformation are provided to show 
the ease and flexibility of Hinitab II. Output exhibits' for these 
exaimples are appended. (CH) - \ 
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1. Introduction * 

Minitab is a general purpose statistical computing system which was originally devclQt>ed |br use by 
students in introductory, pre-calculus statistics courses. It has had a great impact on teaching these courses anc 
many advanced courses a's well. When the system was originally planned^ the intent waslto allow s^udeiui tc 
analyze more extensive and more realistic data sets, and thus get a better perspective on hpw siatistic^s isreally 
done. This objective has certainly been achieved. But what we didnU realize was that the b^siq emph$si$ of piir 
teaching would change. Relieved of tedious calculations and worry about the technicalities of ,t(ta}istical 
procedures, we and the students arQ now free to concentrate on the important concepts olj' statistics — on 
exploring data and choosing good procedures to analyze the data. 

The original Minitab sy.slcm (6), ba.scd on the Omnitab System of the National Bur(5au o,lrStandalrds [2, 3, 
4, 9 J, was begun in 1971. It was used cxtcnsfVely at Penn State, both in courses and research, and was exported 
to over 6S institutions. Minitab II, which has just become available to users, is a significan tly improved and 
expanded version. ' ' * ' ^ 



Minitab can be readily kistalled, in either batch of interactive mode, on any medium to 
2. How Minitab V/orkt ^ 



large cothputer. 



Minitab consists of a worksheet (normally 200 rows by 50 columns) in which data are stored, plus a 
collection of about 1 30 pseudo English commands which operate on the data in the woi-ksheet. The easiest way 
to sit how Minitab works is to look at a simple example. 

Problem: Suppose 3 exams were given in a course. A student's final grade is his average for the 3 exams. 
Find the final grade for each student, and find the class average of the final grades. There is 6ne data card for 
each student containing his 3 exam grades. 



o 
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Minitab program: 



READ THE FOLLOWING GRADES INTO COLUMNS CI -03 



85 


82 


90 


89 


75 


82 


95 


85 


72 


100 


98 


95 


98 


90 


91 
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ADD ChC3. STORE SUMS COLUMN d4 . / 

Dl VIDE COLUMN C4 BY 3, PUT FINAL GRADES INTO C5 

PRINT THE FINAL GRADES WHICH ARE IN CS • . 

AVERAGE THE NUMBERS IN C5 

STOP V . ' f 



^Thc output is given in Exhibit 1. • 
Some basic points, ' . ^ 

(1) Minitab is an interpreter. It reads (and prints) a command card, checks for errors, executes the 
command, and thef)sg(2S§^ *t6 the next command. 

(2) The columns of the worksheet are denoted by CI, . . ., C50. In addition to the worksheet, Minitab can 
store up to 50 constants, denoted by KJ^ , . ., KSO, and 10 matrices, denoted by Ml, . . ., M 10. (At present only 
a few of the. planned matrix operations have been implemented.) >^ will use the word "argumenfyto refer 
either to a column/ a'stored constant, a matrix, or' a number. \ 

• ■ \ 

(3) On a command card, only the (first 4 letters of the) command nkme and the arguments (in the proper 
order) are used by the computer. All command and data cards arc Kee format. For example, the third 
command card in the above program, contains the command name DIWDJE ancj^arguments C4, 3, C5. We; 
could just as well have written this command as: DJVID^ C4 BY 3, PUT^ C5, (ft even DIVI C4 3 C5. A list 
of consecutive column rtumbers may be indicated by using a dash, as in the R£AD cotstmand. 

V \ - 

3. The Commands In Minitflb / 

Minitab*^ 130 comnl^'nds provides a wide variety of capabilities: including: 

Arithmetic: ADD, MULTIPLY, SQRT, MAXIMUM, COUNT, PARSUMS, etc. 

Functions:, LOGTEN, LOGE, SIN, ABSOLUTE VALUE, etc. j 

DescripHve^atlstic^: AVERAGE, MEDIAN, HISTOGRAM, CORRELATION, etc. 

Manipulations: SQRT, OMIT, CHOOSE, PICK, JOIN, etc. I * ;V ' / 

Plots: PLOT, MPLOT (multiple), TPLOT (pseudo three din^tnsional), etc. / 

Basic Statistics: TINTERVAL, TTEST, CONTINGENCY f ABLE, REGRESS (Simple, polynoitiiniaf 
and multiple); AOVONE\VAY, etc. \ 

/ ■ ' 

Data Generation: NRANDOM (normal), URANDOM (i/niform), PRANDOM <Poisson), SAMPp 
(without replacement), etc. \ 

In addition, many other capabilities not expressly built in as commands c$i) be„Teadit^ dDnc by linking 
tojgciher a few commands. For example SUBTRACT and TTEST will give a paired t-tes^N SCORES a^d 
■^liOT will do a normal probability plot, URANDOM and LOGE will give exponential random data, and so 

Input and output is usually free format, but Fortran fam^rt-stalcjticnts ^an also be used when desired. 

Other capabilities currently undcTofiv^pmcnt include (1) a very genc/al analysis of variiince capability, 
. t2) matrix operations, (3) looping, (4) lab^g variables, (5) robust staWtical procedures, (6) non-lincai** 

. ■ // . ■ 



rcizrcs^ion, (7) analysis-of multidimensional contingency tablcsVand'(8) Fortran-like algebraic statcmertls: e.g., 
COMPUTE C5 = SIN(CI/2) + C2**5. \ ^ 



4, Some Examples 

The followir^ four exampFes show the ease and flexibility of using Minitab in undergraduate courses. 

A. Paired t-tcsl. . 

Exercise. A study was done to determine the effectiveness of (iartoon vs. realistic pictures iri viJiual 
aids, A short sfide presentation containing 9 cartoon and 9 realistic slides >yas given to 34 University students. 
After the presentation, each student'took a quiz which contained one question on each/ slide. Compare the 
effectiveness of cartoon vs. realistic slides. The data has been punched into cards. There lis one card for each 
student containing his score on the 9 cartoon questions followed by his score on the 9 realistie' questions. 



Minitab does not;hdve a command to do a paired t«test. However, since a paired t-test is just a test of 
J |j = jQj)n the differences, we can use the following program; \ 

CARtOON SCORES INTO CI, REALISTIC SCORES INTO C2 
(^4 data cards go here) 
SUBTRACT C2 FROM Ch PUT DIFFERENCES INTO C3 

TTEST WITH MU «\0 ON DIFFERENCES IN C3 * 
STOP ' 'l^ 



The output is ih Exhibit X. The rfcsuUs of 'the test show that the students did oVerwhejmingly better on the 
cartoon questions than On the realistic questions, .suggesting^at cartoon pictures make learninf^ easier. 

B. Data Analysis. ' ■ , 

^kercise. The.^udents in the visual aid study of Example A were also given the OTIS Quick Scoring 
Mental Ability Test, which yielded a rough estimate of their natural ability. This score was punched as the third 
number on each student's data cSrd. What do the OTIS scores tell you about the experiment and the analysis of 
Example A? - 

' This is a fairly openendcd exercise, where a student is free to explore the data using various techniques. 
Irlerc suppose we just look at the rdlationship between the realistic scores and the OTIS scores. We usually 
pxpect a high-correlation between ability and test scores.vThe Minitab program: 



/ RHAD CARTOON SCORES INTO C\ REALISTIC SCORES INT0C2, OTIS SCORES INTO C3 

i 



OvKUi l.AI ION i vvi-r-N r2 AND n 



S I OP \^ ' ^ , • 

' finds a correlation of only 0.237. A plot often shows what's really going on in a da/a set; the command 
PLOT REALISTIC IN C2 VS. OTIS IN C3 ' 

Produced tho plot in Exhibit 3. Two things are immediately apparent — both of which tend to lower the 
correlation cocfncicnt. First there is an outlier. A check of the data shows that this same student also had a very 
low cartoon score; Perhaps he should be omitted from analysis — at least temporarily. The plot also shows a 
' truncation effect in the students' cartoon scores. Those with high OTIS scores tended to run into a ceiling on 
> thc tcsv indicating that the slide-presentation and exam was tao easy for these students. This may mean loss of 
intofrtiation. 



C. Calculus. • ^ i 

Although Minitab was designed as a statistical system, its. flexible structure cncourageit its use In other 
fields. The following example shows ho^ Minitab can illuminate the concept of a limit. 

liM'nise. Use Minitab to study the limit of (sin x)/ as x -> 0. A student miglit use tlie following program: 

GI-NI-RATri Tlin INTF.GHRS FROM 1 TQW^INTO CI 

DIVIDH 1 BY CI, PUT INTO C2 (X-VALUES) J / 
SIN OF C2. PUT INTO CI o . / 

DIVIDE C3 BY C2, PUT INTO C4 (SINX/X) 
PRINT X IN C2. SINX/X IN C4 

STOP. „ 

He could tj)ciT use the same technique to find the limit from below,, and observe that in both cpscs (sin x)fX 
convc^gifs lo U a:! x approaches 0. 




Minitab can generate dntit from most of the distributions used in statistics. This allows students to do 
Ife usual simulation exercises — coin tossing, dewing balls from urns, observitions of x, generating confide ice 
intervals, generating data from a given regression models etc. 

The following example, taken from thp Minitab Handbook [8], shows how simulation can be used to 
illustrate a fairly advanced concept — the Arc Sine Law [I] 

I.osltif4 Streaks. Gamblers, sports fans and investors irv the stock market \ll claim they have ''winring 
streaks** and "losing streaks*'. In this example, we will use simulation to shovV that these streak's may be 
m)Uu^2] more ihan chance fluctuations. Consider the following simple gambling gatne. You and your opponent 
each b^t a dollar. Then you toss a fair coin. If it comes up tails, you win one dollan If it comes up heads, you 
lose one (ioilar. Suppose you continue to play this game for a long time. Would you expect to be ahbad fdr a 
sho'n while, then behind for a short while, then aheafj'again, and so on? Or would you expect to be ahead fcir a 
very long time (ix\ a wifining streak) and then beh^id^for a long time (in a losing streak)? j 



The following pro-am simulates this game for 60 plays: ; 

READ VALUES INTO CI. PROBABILITIES INTO C2 * ' 

-1 .5 
1 .5 

^ DRANDOM 60 OBS., VALUES IN CI, PROB. IN C2, PUT IN C3 ; 
PARSUM C3, PUT PARTIAL SUMS IN C4 / 
GENERATE 60 INTEGERS INTO C5 

WIDTH 60 HEIGHT 20 FOR PLOT * \ 

PLOT C4 VS. C5 * i 

STOP ' 

In this program we first generated a sequence of sixty random + I's and -I's to indicate for eacl| toss whet 
you v^on or lost. They were put in C3. The command PARSUM calculated your total winnings pr losses after 
each loss. The last three commands plot your successive "winnings'* as the game progresses. T^i^'plotjis shojwn 
in L'xhibit 4. It*.s quite vivid from the plot that you were in a very long losing streak. In fact, this Iprt of pattern 
of long winning or losing streaks is quite typical. Usually you will spend most of the time iii longf winning 
streaks, or in long losing streaks. So it may not be 'Muck'* after all that gives gamblers their winning a|nd losing 
streaks. It may be just the laws of chance. 
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5, Ph!ici«cphy ond Uoo o? ilrllnllsb 

. Basic to lljc-dcvclopmcnt of Minitab is a strong belief that the system must be very easy to use, and at the 
same lime be a genuine statistical tool; By this we fnean that it should J)e useful for doinjg real statistics, not just 
* simple Wtbook problems. ^ / ^ ' 

The best evidence that Minitab is easy to use comes from an introductory statistics course given at Penn 
Statcv Each year, over 1,000* students take this coufse. Although most of these students have never used a 
computer before and have poor mathematical backgrounds, they are able to write, punch, and run Minitab 
programs after one hour of instruction. Except for this flrst sessidn, very little class time ii spent on Minitab, 
per se. As new statistical methods are introduced, the corresponding commands are mentione^.Typically 10 
prpblcms involving the computer are assigned each term. Many of the stuflents in this course Continue to use 
Minitab in subsequent courses and in research. 

Why is Minitab so easy to use? First its language is simple, natural, and consistent. To illustrate what we 
mean by natural, let's look at one small example. Consider the two commands ADD CI TO C2, PUT IN C3 
wand ADD 1 TO C2, PUT IN C3. It> clear that th« first command ac|di 2 columns together while the second 
add:; the number 1 to (nil entries in) a column. In Omnitab (and the oijginal Minitab) these 2 commands are 
written ADD 1 TO 2, PUT IN 3 and ADD l.O TO 2, PUT IN 3. Omnitab uses the presence or absence Of i 
decimalpoint to distinguish numbers from column numbers. One or two artificialities such as this are tolerable; 
Bat too many and students get annoyed and confused by the special rules and conventions they must remember.^ 

Second, Minitab's output has been designed (and redesigned) to be as clear, as concise, and as 
. pcdologically sound as possible. Consider the regression output in Exhibit 6* To a beginning student, the most 
important part of regression is the itff^%\oi\ equation, so this is what appears Jlrst. Having the equation in this 
' form also helps a student orient himself to the tables which follow. 

Third,^ Minitab keeps track of all bookkeeping autonlatically. For example, you don;t have to specify how 
many data cards there arc to be read. Minitab counts them for you. You can also freely mix different length 
columns. 

^Fourth, Minitab has default options, so that easy things are easy to do. If you want to plot column 1 vs. 
column 2, you can use the command PLOT CI VS. C2, and never worry about specifying scales. If however, 
you want to control the scales, or the size of your plot, you can use the more advanced plotting capabilities. 

And finally, Minitab has extensive error checking with good, clear, English diagnostics. All error messages 
come from the Minitab system and not from the computer's operating system. * " 

The best evidence that Minitab is a genuine statistical tool comes from its research users. For example, 
Minitab is heavily used by the Penn State Statistical Consulting Center — especially for regression analysis, 
and for analyzing smalt data sets. It is also used in industrial environments, e.g., at The Alcoa Technical Center 
where it's used primarily by experimenters who are not professional statisticians. Of course, Minitab is a 
limited tool — it docs not have any multivariate analysis, (no factor analysis, etc.), nor any time series, and it 
cannot handle extremely large data sets. But, within its scope, it is extremely useful. 

Qccaufic students know they are using a real tool, and not just a teaching device, they arc very enthusiastic 
about Minitab'and about the computer. It also appears that once students have "conquered" the computer with 
an casy-to-Use system such as Minitab, they can then go on to more sophisticated packages like SPSS, BMD, 
or Omnitab with very little difficulty. 

6. How Minitab Can Changt the Way you Teach — Ont Example 

Beginning students find it easier to appreciate statistical concepts if the concepts are extensively illustrated 
by examples, and particulary if the examples inv9lve good graphical techniques. Here we outline how 
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polynomial wgrcssion and data transformations can be explained to students )vho have studied only simple • 
regression and testing. 

The Example: The classical pendulum experiment fr.om physic* is designed to show the relationship 
between the length of a pendulum ;»nd its cycle time (or period). We can use the following data 15] to explore - * 
this relationship: . 



length (L) 


175.2 


151.5 


126.4 


101.7 


77.0 


time for one cycle (T) 


2,650 


2.468 


2.256 


2.024 


1.764 ■ 



A plot of T vs L, given in Exhibit 5, showa a very strong linear relationship, indicating that the equation 
T = Bo + B, L should provide a reasonable fit. The output from this fit is given in»Exhibit .6. (How much of 
■ this output should be explained to studenLi depends on how much statistics: they know). The results from this 
first fit look fairly good {high R', significant coefficients) but we shouldn't quit yet. 

At this point it is convenient to introduce residual plots as a powerful visual technique for examining lack 
of fit. One such plot is shown in Exhibit 7. This plot indicates that there was some curvativc in the original data. 
An obvious thing to try ntxt ii the quadratic. T= B. + B, L + B, L'. Ai far as students are coniccrnc^ 
fitting this equation is just an obvious extension of simple rcgrcttlon, and easy to do using the MmUaBP- 

program: ! , • , . 

READiLENpTH INTO CI. CYCLE TIME INTO C2 

(data caraS) ' ' • 

MULTIPLV^C! DY Cl. PUT INTO C3 
' REGRESS TIME IN C2, ON 2 PREDICTORS IN Cl. C3. STORE RESIDUALS IN 04' 
PLOT RESlbUALS IN C4 VS LENGTH IN Cl 

STOP I \ : < " / 

This quadratic cqu^ltion fits the data very well, with no lack of fit indicated by the residual plot. Of co'urse. we 
know from physics that the "true" relationship between L and T is. T = B.VL. We can also try fitting this 
modd. Again, the concept of data transformations is very natural, and easily explained to students. This fit is 
also very easy to do using the Minitab program: 

READ L INTO C. T INTO C2 - ' 

(data cards) 
SORT OF Cl, PUT INTO C3 

NOCONSTANT (IN FOLLOWING REGRESSION COMMAND, FIT EQUATION 

WITHOUT BO TERM) 
REGRESS T IN C2 ON I PRED. IN C3, STORE RES. IN C4 
PLOT RES. IN C4 VS LENGTH IN Cl 
STOP 

7. AvfiiabiUty and Implomontallon of th» Mlnltab Syittm 

Mihiiab is written in machine compatible FORTRAN IV and has been inplemcntcd oh a wide variety of 
computers, including IBM 360 models 30 and up, IBM 370, small and large CDC, UNIVAC, Burroughs, 
• Xerox, DECsystcm-10. It requires about 35K words of memory, and can be overlayed into less than half that 
space on many computers. It runs in cither batch or interactive mode. A copy of the system is available, either 
on tape or cards, at a nominal charge from the Penn State Statistics Dept. Documentation includes (1) a 90 ' 
page Reference Manual and (2) a Student Handbook (about 200 pages), which includes an introduction to ihc 
Minitab system, material on how to use Minitab for statistical analysis, examples and exercisfcs. 
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" — READ THE FOLLOWING GRADES INTO COLOMNS C1-C3 



COLUK.^ CI . C2 C3 

COUNT 5 5 ,5 

H ,^rf5. 32. 90. 

89. - 75. 82. 

95. ■ 85. N 72.. • 

100. 98. 95. 

^fl. 90^ 91. 



— ADD C1-C3, STORE SUMS IN COLUHN C4 

— DIVIDE COLUMN CU BY 3, PUT FINAL GRADES ,J«NT0 C5 

^- PRIHT THE FINAL' GRADES WHICH AKfi IN C5 ^ 

COLUMN C5 • ■ 

COUNT 5 - 

^5.6667 82.000i 84.0000 97.6667 93.0000 

— AVERAGE THE NUMBERS IN C5 
AjVERAGE = 88.467 

-- STOP 
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Exhibit 2 



BI.AD CA^lTOOIl' SCOHBS INTO Cl ftEALlSl'IC* SCOR&S^ I*»TO C2. 

SU&TSACT C2-FR0n C1,PUT DIFFERENCES INTO, C3. . * 
TTEST WITH MU=0 ON DIF^^ RENCES IN .C3 , 
C3 H = 3<* MBAH = . 1.0000 ST. DEV. = " 

TfeST OF MO a.- '0,0 VS. hj N.E." 0.0 

T:= aj.558 \ ■ 



"IIIE TEST IS sdlTtSTICALLY SIQ^tF-lCANT AT Al?P,HA = 0.0000 

' Ik . . , • , 



STOP 



Exhibit 3 



/ 



— PLOT REAUStlC IN C2, VS". OTIS IN*. C3 * 

2" , 



9.0+ 



♦ ♦ ' ♦ ♦ ♦ 



6.0+ 



3.0 + 



* * * 



0.0+ 
95. 



♦ 



V./ 



125. 



no; 



, — 4. 

lao. 




.0.0 



20.0 



10.0 



10.0 



30.0 



number oP tosses 

— — + 5 

60.0 

50.0 . 



Exhibit 5 



PLOT CYCLE TIME IN C2 VS LENGTH IN Cl i 

2 . ' 

2.80 + 



2.40 + 



2.00 + 



1.60+ 



+--■ 
75. 



165. 



120. 



1 

210, 



Exhibit 6 



THE hSaRBSSION EQUATION JS 
Y« 1.09 ♦ (0,900D-02) XI 



ON 1 PhKDICTUR IN CI 





X IN 


' Y IN 


ROW 


^ COL. 1 


COL. 2 


1 


175.200 


2.650 


2 


151.500 


2.U6d 


3 


126. UOO 


2.256 


U 


101.700 


2*02U 


5 


77.0Q0 


1.76«* 



PhfcD Y 


STD DHV 




/ STD.> 


VALUES 


PcvE& Y 


RFS. / 




2.672 


0.(519 


-0'.022/ 


-1.3fltt 


2.45^; 


O.OlU 


0.604 


0.UU3 


2.^233 


O.OM 


0.023 


1 . 039 


2.010 


0.01U 


O.J014 


U.651 


1.7BB 


O.019 


-0/624 


-lp520 



X COLUMN 
0 

1 1 

tH^E ESTIMATED ST. 

«JTH ( 5- 
R-SQUARED - 
R-SQOARED ' 



2) 



ESTIMATES FROM LEAST SOOAffES FIX 
COEFFICIENTS S.D.OF COEFF. 

. T.09U7 ^ 0.a21i)-01 

0.90036D-02 0. 32/1D-03 

DEV. OF Y ABOUT THd RRGRESSriON LINE IS 
3 DEGREES OF PREFDOM / 



RATIO 
26.00O 
28.03 



0.i)250 



99.6 PERCENT 

99.5 PERCENT { ADJUSTED FO£v D.^*-) 

ANALYSIS OF VARIANCE , 
DF ' S3 

TOTAL 5 ^ZS.al 

DUE TO .1EAN , 1 - 24.92 

TOTAL DUE TO REGRESSION 1 0.4914 

-RESIDUAL 3 0.ia76D-02 

X TRANSPOSE X INVERSE IN TLOWER TRIANGLE 

CORRELATIONS OF ESTIMATES OF COEFFICIENTS IN UPPER TRIANGLE 
♦ 1- 9 

. 2.B3397 -0.96 

1 -0.2 08450D-C1 0. 164965D-03 



«S»SS/DF 



0.4914 
0.6255D-03 



exhibit 7 



PLOT B^fiSIDUALS^'IN C3 VS LENGTH IN CI 

3 ^ . 



1 



1.00* 



0.0 ♦ 



-i:oo* 



i' 



-2.00+ 



f — 
■ 75. 



f 1 



165. 



120. 



210. 
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